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Abstract. LeoPARD supports the implementation of knowledge repre¬ 
sentation and reasoning tools for higher-order logic(s). It combines a so¬ 
phisticated data structure layer (polymorphically typed A-calculus with 
nameless spine notation, explicit substitutions, and perfect term shar¬ 
ing) with an ambitious multi-agent blackboard architecture (supporting 
prover parallelism at the term, clause, and search level). Further features 
of LeoPARD include a parser for all TPTP dialects, a command line 
interpreter, and generic means for the integration of external reasoners. 


1 Introduction 

LeoPARD (Leo’s Parallel ARchitecture and Datastructures) is designed as a 
generic system platform for implementing higher-order (HO) logic based knowl¬ 
edge representation, and reasoning tools. In particular, LeoPARD provides the 
base layer of the new HO automated theorem prover (ATP) Leo-III, the suc¬ 
cessor of the well known provers LEO- I [U and LEO-II [7]. 

Previous experiments with LEO-I and the OAnts mechanism indicate 
a flexible, multi-agent blackboard architecture is well-suited for automating HO 
logic [6]. However, (due to project constraints) such an approach has not been 
realized in LEO-II. Instead, the focus has been on the proof search layer in 
combination with a simple, sequential collaboration with an external first-order 
(FO) ATP. LEO-II also provides improved term data structures, term indexing, 
and term sharing mechanisms, which unfortunately have not been optimally ex¬ 
ploited at the clause and the proof search layer. For the development of Leo-III 
the philosophy therefore has been to allocate sufficient resources for the ini¬ 
tial development of a flexible and reusable system platform. The goal has been 
to bundle, improve, and extend the features with the highest potential of the 
predecessor systems LEO-I, LEO-II and OAnts. 

The result of this initiative is LeoPARE 0, which is written in Scala and 
currently consists of approx. 13000 lines of code. LeoPARD combines a sophis¬ 
ticated data structure layer (polymorphically typed A-calculus with nameless 
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spine notation, explicit substitutions, and perfect term sharing), with a multi¬ 
agent blackboard architecture [IS] (supporting prover parallelism at the term, 
clause, and search level) and further tools including a parser for all TPTP [22123] 
syntax dialects, generic support for interfacing with external reasoners, and a 
command line interpreter. Such a combination of features and support tools is, 
up to the authors knowledge, not matched in related HO reasoning frameworks. 

The intended users of the LeoPARD package are implementors of HO knowl¬ 
edge representation and reasoning systems, including novel ATPs and model 
finders. In addition, we advocate the system as a platform for the integration 
and coordination of heterogeneous (external) reasoning tools. 


2 Term Data Structure 

Data structure choices are a critical part of a theorem prover and permit reliable 
increases of overall performance when implemented and exploited properly. Key 
aspects for efficient theorem proving have been an intensive research topic and 
have reached maturity within FO-ATPs [19120] . Naturally, one would expect an 
even higher impact of the data structure choices in HO-ATPs. However, in the 
latter context, comparably little effort has been invested yet - probably also 
because of the inherently more complex nature of HO logic. 

Term Language. The LeoPARD term language extends the simply typed A- 
calculus with parametric polymorphism, yielding the second-order polymor- 
phically typed A-calculus (corresponding to A2 in Barendregt’s A-cube [3]). 
In particular, the system under consideration was independently developed by 
Reynolds [16] and Girard [14] and is commonly called System F today. Further 
extensions, for example to admit dependent types, are future work. 

Thus, LeoPARD supports the following type and term language: 

T,v ::= t G T (Base type) 

I a (Type variable) 

\ T ^ 1/ (Abstraction type) 

I Va. T (Polymorphic type) 

S,t ::= Xr G Vt \ Cr G S (Variable / Constant) 

I {\Xr Si/)r->jy | (sr-s-iv tr)i^ (Term abstr. / appl.) 

I {Aa Sr)\/a T I (sva r v)r[a/y] (Type abstr. / appl.) 

An example term of this language is: 

AaXPa^o ((/v/3 {p^o)^o^o a) {XYa P V)) To. 

Nameless Representation. Internally, LeoPARD employs a locally nameless rep¬ 
resentation (both at the type and term level), that extends de-Bruijn indices to 
(bound) type variables [15]. The definition of de-Bruijn indices [11] for type vari¬ 
ables is analogous to the one for term variables. Thus, the above example term 






is represented namelessly as 

{AX,^o ((/v(i^o)^o^c 1) (Ai 2 1)) To) 

where de-Bruijn indices for type variables are underlined. 

Spine Notation and Explicit Substitutions. On top of nameless terms, LeoPARD 
employs spine notation [12] and explicit substitutions [1]. The first technique 
allows quick head symbol queries, and efficient left-to-right traversal, e.g. for 
unification algorithms. The latter augments the calculus with substitution clo¬ 
sures that admit efficient (partial) /3-normalization runs. Internally, the above 
example reads 

/v(i^o)^o^o ■ (1; Ai 2 • (1); T) 

where • combines function heads to argument lists (spines) in which ; denotes 
concatenation of arguments. 

Term Sharing/Indexing. Terms are perfectly shared within LeoPARD, mean¬ 
ing that each term is only constructed once and then reused between different 
occurrences. This does not only reduce memory consumption in large knowledge 
bases, but also allows constant-time term comparison for syntactic equality us¬ 
ing the term’s pointer to its unique physical representation. For fast (sub-)term 
retrieval based on syntactical criteria (e.g. head symbols, subterm occurrences, 
etc.) from the term indexing mechanism, terms are kept in /3-normal 77 -long form. 

Suite of Normalization Strategies. LeoPARD comes with a number of differ¬ 
ent (heuristic) /3-normalization strategies that adjust the standard leftmost- 
outermost strategy with different combinations of strict and lazy substitution 
composition resp. normalization and closure construction. 77 -normalization is in¬ 
variant wrt. /3-normalization of spine terms and hence 77 -normalization (to long 
form) is applied only once for each freshly created term. 

Evaluation and Findings. A recent empirical evaluation m has shown that there 
is no single best reduction strategy for HO-ATPs. More precisely, for different 
TPTP problem categories this study identified different best reduction strategies. 
This motivates future work in which machine learning techniques could be used 
to suggest suitable strategies. 

3 Multi-agent Blackboard Architecture 

In addition to supporting classical, sequential theorem proving procedures, 
LeoPARD offers means for breaking the global ATP loop down into a set of sub¬ 
tasks that can be computed in parallel. This also includes support for subprover 
parallelism as successfully employed, for example, in Isabelle/HOL’s Sledgeham¬ 
mer tool [S]. More generally, LeoPARD is construed to enable parallalism at 
various levels inside an ATP, including the term, clause, and search level [9] . For 
this, LeoPARD provides a flexible multi-agent blackboard architecture. 


Blackboard Architecture. Process communication in LeoPARD is realized indi¬ 
rectly via a blackboard architecture [53]. The LeoPARD blackboard |5S] is a 
collection of globally shared and accessible data structures which any process, 
i.e. agent, can query and manipulate at any time in parallel. From the black¬ 
board’s perspective each process is a specialist responsible for exactly one kind 
of problem. The blackboard is generic in the data structures, i.e. it allows the 
programmer to add various kinds data structures for any kind of data. Insertion 
into the data structures is handled by the blackboard. Hence, each specialist can 
indeed by specialized on a single data structure. 

The LeoPARD blackboard mechanism and associated data structures pro¬ 
vide specific support for nested and-or search trees, meaning that sets of formulae 
can be split into (nested) and-or contexts. Moreover, for each supercontext re¬ 
spective TPTP SZS status [55] information is automatically inferred from the 
statuses of its subcontexts. 

Agents. In LeoPARD specialist processes can be modeled as agents [53]. Clas¬ 
sically, agents are composed of three components: environment perception, de¬ 
cision making, and action execution [24] . 

The perception of LeoPARD agents is trigger-based, meaning that each 
agent is notified by a change in the blackboard. LeoPARD agents are to be 
seen as homomorphisms on the blackboard data together with a filter when to 
apply an action. Depending on the perceived change of the resp. state of the 
blackboard an agent decides on an action it wants to execute. 

Auction Scheduler. Action execution in LeoPARD is coordinated by an auction 
based scheduler, which implements an own approximation algorithm [25] for 
combinatorical auctions [2]. More precisely, each LeoPARD agent computes 
and places a bid for the execution of its action(s). The auction based scheduler 
then tries to maximize the global benefit of the particular set of actions to choose. 

This selection mechanism works uniformly for all agents that can be imple¬ 
mented in LeoPARD. Balancing the value of the actions is therefore crucial for 
the performance and the termination of the overall system. A possible generic so¬ 
lution for the agents bidding is to apply machine learning techniques to optimize 
the bids for the best overall performance. This is future work. 

Note that the use of advanced agent technology in LeoPARD is optional. 
A traditional ATP can still be implemented, for example, as a single, sequential 
reasoner instantiating exactly one agent in the LeoPARD framework. 

Agent Implementation Examples. For illustration purposes, some agent imple¬ 
mentations have been exemplarily included in the LeoPARD package. For ex¬ 
ample, simple agents for simplification, skolemization, prenex-form, negation- 
normal-form and paramodulation are provided. Moreover, the agent-based inte¬ 
gration of external ATPs is demonstrated and their parallelization is enabled by 
the LeoPARD agent framework. This includes agents embodying LEO-II and 
Satallax [10] running remotely on the SystemOnTPTP [22] servers in Miami. 
These example agents can be easily adapted for other TPTP compliant ATPs. 


Each example agent comes with an applicability filter, an action definition 
and an auction value computation. The provided agents suffice to illustrate the 
working principles of the LeoPARD multi-agent blackboard architecture to in¬ 
terested implementors. After the official release of Leo-III, further, more so¬ 
phisticated agents will be included and offered for academic reuse. 

4 Other Components 

The Leopard framework provides useful further components. For example, a 
generic parser is provided that supports all TPTP syntax dialects. Moreover, a 
command line interpreter supports fine grained interaction with the system. This 
is useful not only for debugging but also for training and demonstration purposes. 
As pointed at above, useful support is also provided for the integration of external 
reasoners based on the TPTP infrastructure. This also includes comprehensive 
support for the TPTP SZS result ontology. Moreover, ongoing and future work 
aims at generic means for the transformation and integration of (external) proof 
protocols, ideally by exploiting results of projects such as ProofCer10. 

5 Related work 

There is comparably little related work to LeoPARD, since higher-order the¬ 
orem provers typically implement their own data structures. Related systems 
(mostly concerning term representation) include AProlog and Teyjus [17) . the 
Abella interactive theorem prover m, and the logical framework Twelf m- 
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